Extraction de PCFG et analyse de phrases pré-typées (PCFG Extraction and Pre-typed Sentences Analysis) [in French]
نویسنده
چکیده
PCFG Extraction and Pre-typed Sentences Analysis This article explains the way we extract a PCFG from the Paris VII treebank. Firslty, we need to transform the syntactic trees of the corpus into derivation trees. The transformation is done with a generalized tree transducer, a variation of the usual top-down tree transducers, and gives as result some derivation trees for an AB grammar. Secondely, we have to extract a PCFG from the derivation trees. For this, we assume that the derivation trees are representative of the grammar. The extracted grammar is used, via the CYK algorithm, for sentence analysis. MOTS-CLÉS : Extraction de grammaire, grammaire de Lambek, PCFG, transducteur d’arbre, algorithme CYK.
منابع مشابه
PCFG Extraction and Pre-typed Sentence Analysis
We explain how we extracted a PCFG (probabilistic contextfree grammar) from the Paris VII treebank. First we transform the syntactic trees of the corpus in derivation trees. The transformation is done with a generalized tree transducer, a variation from the usual top-down tree transducers, and gives as result some derivation trees for an AB grammar, which is a subset of a Lambek grammar, contai...
متن کاملMicrowave Assisted Extraction of Olive Oil Pomace by Acidic Hexane
In this study, Microwave-Assisted Solvent Extraction (MASE) was used to recover oil residues from pomace olive using acidic hexane. Results obtained demonstrated that oil extraction yield increased with time, the amount of acetic acid in hexane and power radiation. For both radiation powers used (170 and 510W), the optimal extraction time and most interesting content of acetic acid in hexan...
متن کاملExtraction of Drug-Drug Interaction from Literature through Detecting Linguistic-based Negation and Clause Dependency
Extracting biomedical relations such as drug-drug interaction (DDI) from text is an important task in biomedical NLP. Due to the large number of complex sentences in biomedical literature, researchers have employed some sentence simplification techniques to improve the performance of the relation extraction methods. However, due to difficulty of the task, there is no noteworthy improvement in t...
متن کاملStudying impressive parameters on the performance of Persian probabilistic context free grammar parser
In linguistics, a tree bank is a parsed text corpus that annotates syntactic or semantic sentence structure. The exploitation of tree bank data has been important ever since the first large-scale tree bank, The Penn Treebank, was published. However, although originating in computational linguistics, the value of tree bank is becoming more widely appreciated in linguistics research as a whole. F...
متن کاملSimplification de phrases pour l'extraction de relations (Sentence Simplification for Relation Extraction) [in French]
Sentence simplification for relation extraction Machine learning based relation extraction requires large annotated corpora to take into account the variability in the expression of relations. To deal with this problem, we propose a method for simplifying sentences, i.e. for reducing the syntactic variability of the relations. Simplification requires the annotation of a small corpus, which will...
متن کامل